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A quantitative description of the qualitative feature of multihadron final states known as the 
"number of jets" is given by a sequence of infrared finite shape observables (jet discriminators) that: 
take continuous values between and 1; are stable — unlike clustering algorithms — against small 
variations of the input (data errors, Sudakov effects etc.); have a form of multiparticle correlators 
that is natural in the context of quantum field theory and hence are better suited for a systematic 
study of theoretical uncertainties (logarithmic and power corrections). 



1. The jet paradigm is the foundation of the high-energy 
collider physics'. It is based on the experimental evidence for 
hadronic jets 2 and the Quantum Chromodynamics-based pic- 
ture of hadronic energy flow inheriting the shape of partonic 
energy flow in the underlying hard process 3 . However, the 
problem of adequate numerical description of multijet structure 
of multihadron events proved theoretically subtle, its apparent 
simplicity turned out deceptive, while its satisfactory solution, 
elusive. The fundamental role of calorimetric measurements in 
high-energy experiments warrants a scrutiny of the logical 
principles of such measurements. 

2. It makes sense to divide the problems where jets are 
studied into two classes. The descriptive theory of hadronic 
jets studies the dynamics of jets as such 4 ; one is mostly inter- 
ested in qualitative effects that occur in the leading logarithmic 
order; a systematic improvement of theoretical predictions is, 
typically, hardly possible. 5 

The second class (precision measurements) comprises 
quantitative studies of the Standard Model (determination of 
&s(Q 2 ) ~> 00 etc. 1 ) where one aims at a highest reliability for 
both data and theoretical predictions: 

Reliability of data means that the problem ought to be re- 
garded as the one of measurement rather than the one of mod- 
eling dynamics. One has to ensure that measurements be stable 
with respect to errors in data from calorimeter cells, their posi- 
tion and geometry, etc. (otherwise physical information may be 
distorted by artefacts of measurement procedures), and that the 
data experimentalists produce be not biased by the imperfect 
knowledge of details of dynamics. 

Reliability of theoretical predictions means that it ought to 
be possible to systematically include logarithmic and power 
corrections. The observables one uses ought to conform to the 
general structure of the underlying formalism (perturbative 
Quantum Field Theory) to ensure a better control over theore- 
tical uncertainties due to a considerable sophistication of the 
modern analytical methods of the theory of Feynman diagrams 6 . 

3. Jet counting is an attempt to use jets of hadrons to tag 
events. Its great usefulness 7 is due to the fact that the very 
presence of jets and their number is the most direct and clear 
manifestation of the dynamics of QCD. 

The conventional jet counting determines an integer number 
of jets for each event using algorithms 8 which attempt to recon- 
struct the underlying partons' momenta by, in effect, inverting 
the hadronization. They were invented 9 in the context of de- 
scriptive theory of jets, involve many ambiguities 8 , and their 
use in measurement-type problems may not be accepted un- 
critically: 

On the theory side, the definition of jets in such algorithms uses 
phase space cutoffs to take into account cancellations of IR singula- 
rities. This is rather unnatural within the formalism of QFT: one 
has to recur to numerics even in simpler cases 10 whereas a study 
of power corrections remains practically impossible. 



On the measurement side, any algorithm that produces an 
integer number of jets cannot be fully satisfactory — even be- 
fore any dynamics gets involved. Indeed, such an algorithm 
rips the continuum of multiparticle states by mapping it to the 
discrete set of natural numbers. A discontinuous mapping is 
unstable with respect to small variations (measurement errors 
or unknown higher order corrections) for some values of input 
data (cf. Fig. I). 11 As a result, the inversion of hadronization is 
a mathematically ill-posed problem, whence the problem of 
spurious jets, and the sensitivity to Sudakov effects and to irre- 
levant details of recombination procedures. 8 This pathology is 
somewhat masked off by averaging over many events. But a 
deterministic recombination algorithm is applied separately to 
each stochastically generated event. So, instead of a statistical 
compensation of errors, there occurs a smearing between cross 
sections with adjacent numbers of jets. 12 It can be eliminated 
neither by increasing statistics, nor by varying the jet resolution 
y cut , and it is more important for smaller y cul , lower energies 
and larger numbers of jets. 13 

What, then, could be a quantitative measure for the qualita- 
tive feature of multiparticle final states known as the "number 
of jets", a measure that allows a correct handling of data errors 
and a systematic study of theoretical uncertainties, and that is 
unbiased by the imperfect knowledge of jet dynamics? 

4. Mathematical nature of "energy flow". If co is a calo- 
rimeter cell, then the energy deposited in it by particles that hit 
the cell is E(co) > . Energy conservation implies that if one 
takes two non-overlapping cells co and co' and combines them 
into one, then the energy deposited in it is the sum of energies 
deposited in co and co' separately: E(co u co') = E(co) + E(co') . 
One can consider cells co simply as parts (subsets) of the unit 
sphere around the collision point. Then the energy flow (EF for 
short) is a non-negative additive function on the subsets CO. 
Such functions are known as abstract measures. • 

Let P be a multiparticle state, P = {£,,/?,}, , where and 
pj are the energy and direction (a unit 3-vector) of the i'-th parti- 
cle. All information about P obtainable using calorimeters is its EF 
represented as a linear combination of 5-functions localized at p, , 

Epift^E.Sp.ip) , (1) 

where p is a variable unit 3-vector running over the sphere. 
The energy measured by a cell co is 

*,<®) = J B dp«,<P) = I Ae(B £i • 

The observables we deal with in calorimetric measurements 
are functions of EFs E. Let f(E) be such a function. Its stabi- 
lity with respect to data errors translates into a concrete kind of 
continuity. Let E n be a sequence of EFs such that, however 
small the energy resolution and geometry of calorimeters, E„ 
become indistinguishable within data errors for all n large 
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enough. This calorimetric or C-convergence of EFs is for- 
malized as follows. Let < (p(p) < 1 be a continuous function 
on the unit sphere. It can be thought of as describing local effi- 
ciency of a calorimeter cell: for a given EF E(p) the ex- 
pression ]ApE{p)(p{p) = (E(p) is the energy measured by this 
cell. Then C-convergence of E n is equivalent to numerical 
convergence of (E n <p) for any "detector" (p. 16 For a correctly 
defined observable /, f(E„) should converge in numerical 
sense for any such sequence E n . Such functions / {calorime- 
tric, or C-observables) are exactly the ones that are stable with 
respect to measurement errors of calorimetric detectors. 

5. The role of C-continuity is best understood with the help 
of analogy with the familiar length measurements. Length is 
habitually represented as a real number — an idealization that, 
one tends to forget, is highly non-trivial from a historical per- 
spective. In particular, the familiar continuity of real numbers 
is useful only inasmuch as it corresponds to the stability of, 
say, volume computations with respect to data errors of the 
length measurements involved. The elusive reality of calorime- 
tric measurements is that whereas rulers measure length as a 
real number, calorimetric detectors measure energy flow as an 
additive function on subsets, and the C-continuity plays exactly 
the same role for data errors of calorimeters as the usual conti- 
nuity of real numbers does for rulers. 

6. A large class of C-observables is immediately found as 
follows. 17 Consider the direct product of m identical EFs 
E(p) . Then the standard theorems 14 imply C-continuity of the 
observables of the following form: 

F m (E) = jd Pl ...jd Pm E(p 1 )...E(p m )f m (Pu-J m ) . (2) 
where f m is any continuous symmetric function. A function on 
EFs induces a function on multiparticle states: using (1) and 
(2) one obtains: 

fMM^ ... im \- E iMK pO ■ (3) 

This is automatically fragmentation invariant. If f m satisfy 
minimal requirements of regularity (e.g. existence of first de- 
rivatives), then F m are IR finite 18 . Such C-observables are in- 
terpreted as average values of operators that are m-local in 
momentum space, which offers a possibility of their systematic 
theoretical study. 19 Examples of C-observables are the well- 
known thrust, spherocity, etc. (see Ref. 4 for a complete list). 20 
Algebraic combinations of C-observables are again C- 
observables. But taking e.g. infinite sums of such functions 
( m — > +oo ) requires care: one can arrive at observables that are 
IR safe in each order of perturbation theory, continuous in the 
ordinary sense as functions of particles' energies and momenta 
for any fixed number of particles, but not C-continuous. 21 A 
complete understanding of this subtlety in a general QFT con- 
text is lacking. Anyhow, C-continuity limits available options, 
and if one wishes to deal with correlator-type observables then 
the above F m remain the only choice. 

7. Measuring the "number of jets". Imagine a step func- 
tion equal to on states with less than m jets, and to 1 else- 
where. A sequence of such functions (m = 1,2, . . .) would do the 
job of jet counting just fine. But we wish to deal with C-obser- 
vables. So, consider a C-observable (3) that is exactly on any 
state with less than m particles. Then f m {p,p,...) = , so that 
/m(Pl>P2>-") should contain anullifying factor A 12 , e.g.: 

A n = l-cos6> 12 = ( Pl p 2 )( Pl Pr l (p 2 Pr l , (4) 



where p, are light-like 4-momenta ( pf = £, , p, = E t pi ) and 

the 4-vector P 2 = 1 describes the reference frame 22 . Symmetry 
yields a similar factor for each pair of arguments. One gets the 
sequence of jet discriminators: 

J„(E 1 ,p 1 ,...,E N ,p N )= Y, E , l --- E , m jm(P, r ---'PiJ' 

\<U<...<i<N 



j m (pu-,p m )=N m n v 

!<('< j<m 



(5) 



It turns out that < /,„ < 1 on any state if N m is defined from 
the condition J m {P^ m ) = 1 where P^ 5 ™ is a limit of uniformly 
distributed states P^ ym with TV — » °° so that E, = N- 1 and 

Then ^ 2=2 ' ^3=27/4, JV 4 =36, 
N 5 = 9375 / 32 , N 6 = 455625 / 128 . 23 

Some special values of J m (P) are as follows. For the state 
P 3 sym consisting of 3 symmetrically arranged particles, 
7 3 (P 3 ym ) = 27/ 32 = 0.84 . For a symmetric state of 4 particles 
(tetrahedron) J 3 (P 4 sym ) = 1 , J 4 (P 4 sym ) = 64 / 8 1 . For a sym- 
metric state of six particles (octahedron), 
/ 3 (P 6 sym ) = / 4 (P 6 sym ) = l. 

1 



^cuf 
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Fig. 1 . The crosses are the values of the jet discriminators J m for a typical 
final state. When looked at sideways, the fat lines represent the "number of 
jets" as a function of y cut . 

Fig. 1 shows a typical picture of values of jet discriminators. 
The usual jet counting amounts to replacing the continuous /,„ 
with or 1 (the blobs). This can be achieved e.g. by introduc- 
ing a cutoff y cut as shown. 24 The non-zero tail at large m is 
due to hadronization (including Sudakov effects). The instabi- 
lity with respect to such effects as well as to data errors (shifts 
of the crosses) is clearly seen. 25 

Note that J m = for m larger than the number of particles 
(or detector cells). For decreasing width of jets and for 
m>M (a typical number of jets in the event), /,„ are in- 
creasingly suppressed by powers of @ 2 and of the energy frac- 
tions of soft particles. This ensures a monotonic decrease of the 
values of /,„ for m>M for typical events. Numerical experi- 
ments show that the decrease of /,„ is a universal feature even 

for m<M . 23 - 26 

Fragmentation causes the values of J m to increase as com- 
pared with the parton state. However, the C-continuity of J m 
ensures that the closer (in the calorimetric sense) the final had- 
ron state to the parton state, the less the difference in values of 
C-observables, and the less the upward shift of J m . 

8. For the case of hadrons in the initial state one should 
modify J m to suppress contributions from the hadron beams. 
For pp , say, it is sufficient to introduce into j m the factor 
1 - cos 2 dj per each particle where 0, is the angle between the 
particle's direction and the beam axis. 
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9. Now, fix a multiparticle state P and consider any jet 
counting algorithm A that produces an integer number of jets 
N A (y cut ;P) for each y cut , which is non-decreasing as 
y™t — > • Then from Fig. 1 one sees that one could, in theory, 
restore a sequence of jet discriminators J$(P) similar to 
J m (P) ■ Thus, the information content of J,£(P) and 
N A (y cut ;P) is essentially equivalent. But it is hardly possible 
to find meaningful expressions for J,$(P) for the popular al- 
gorithms. Our J m (P) are singled out by the transparency of 
analytical structure. 

10. So, studying the average values of jet discriminators 
(J m ) (qualitatively interpreted as fractions of events with no 

less than m jets) instead of the usual «-jet fractions may have 
an advantage of reducing, in perspective, both theoretical and 
experimental uncertainties: 

To compute (J m ) from data, one would treat each calorime- 
ter cell as a particle (the correctness of this is ensured by C- 
continuity). Computations can be optimized due to the regular 
structure of /,„ in several ways: (i) One can do the summa- 
tions a la Monte-Carlo with probabilities equal to energy frac- 
tions, (ii) A preclustering can be used due to C-continuity to 
reduce the number of particles to, say, < 30 when computa- 
tions are easily manageable; since the exact expression is 
known, the approximation errors are fully under control here, 
(iii) The computation of (5) can be parallelized. 

On the theoretical side, studying effects of hadronization 
would reduce to studying logarithmic and power corrections to 
(J m ) . Resummation of logarithms is done via the standard 

renormalization group. The analytical calculations of the cor- 
responding diagrams are easier due to the simple analytical 
form of the weights in the phase space integrals (cf. (4)). Also, 
a prospect opens for a study of power corrections 27 . Recall that 
the power corrections for a tot (e + e~ —> hadrons) oc J 2 are 
given by expressions involving vacuum condensates 28 that are 
directly related to soft singularities; the structure of power cor- 
rections can be obtained within perturbation theory 29 while the 
values of condensates are estimated via the lattice QCD. A 
similar approach should be feasible for the jet discriminators. 30 
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pressed upon me by S. D. Ellis. A crucial encouragement came 
from A. V. Radyushkin. I thank Z. Kunszt and ETH for the 
hospitality during the Workshop on New Techniques for Cal- 
culating Higher Order QCD Corrections (ETH, Zurich, De- 
cember 1992) — its atmosphere catalyzed the present work. 
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ergy (grant DE-FG02-90ER-40577) and by the International 
Science Foundation (grant MP9000). 



On leave of absence from the Institute for Nuclear Research of Rus- 
sian Academy of Sciences, Moscow 1 17312, Russia. 

['] For a review see S. D. Ellis, "Lectures on perturbative QCD, jets and 
the Standard Model: collider phenomenology" at the 1987 Theoretical 
Advanced Study Institute (1987, July, St. John's College, Santa Fe, 
NM), available as technical report NSF-ITP-88-55. 

[ 2 ] G. Hanson et al., Phys. Rev. Lett. 35, 1609 (1975). 

[ 3 ] G. Sterman and S. Weinberg, Phys. Rev. Lett. 39, 1436 (1977). 



[ 4 ] R. Barlow, Rep. Prog. Phys. 36, 1067 (1993). 

[ 5 ] For instance, one needs to identify jet axes to study the QCD coher- 
ence (Yu. L. Dokshitzer et al., Rev. Mod. Phys. 60, 373 (1988)), but 
the notion of jet's axis is ambiguous beyond the leading order of per- 
turbation theory. 

["] Cf. F. V. Tkachov, in Ref. 7. 

[ 7 ] Cf. Proc. New Techniques for Calculating Higher Order QCD Cor- 
rections (ETH, Zurich, 16-18 December 1992), ed. Z. Kunszt (ETH, 
Zurich, 1992). 

[ 8 ] For a review see S. Bethke et al., Nucl. Phys. B370, 310 (1992) and 
S. Catani, in: Proc. 17th INFN Eloisatron Project Workshop, Erice, 
1991, ed. L. Cifarelli and Yu. L. Dokshitzer (Plenum Press, New 
York). 

[ 9 ] T. Sjostrand, Comp. Phys. Commun. 28, 229 (1983); see also Ref. 8. 

[ 10 ] Cf. W. T. Giele and E. W. N. Glover, Phys. Rev. D46, 198 (1991). 

["] A more stable variant of recombination was described in S. Youssef, 
Comp. Phys. Commun. 45, 423 (1987). 

[ 12 ] Cross sections for adjacent numbers of jets differ by 0(a s ) , so 1% of 
3-jet events identified as having 4 jets (due to data errors or incom- 
plete knowledge of hadronization) means an 0( 10%) error for the 4- 
jet cross section. 

[ 13 ] Multijet channels can be used e.g. for top search (F. A. Berends et al., 
Nucl. Phys. B357, 32 (1991)). On the other hand, the study of multi- 
jet cross sections by UA2 (K. Jacobs, in "Joint Int. Lepton-Photon 
Symp. on HEP" (1991, Geneva) World Scientific: Singapore, 1992) 
concluded that the agreement of the data with theory they found for 4— 
6 jets "can be considered as largely accidental". 

[ 14 ] For a thorough treatment see L. Schwartz, Analyse Mathematique, 
vol. 1 (Hermann, Paris, 1967). 

[ 15 ] The word "measure" is used here in two different meanings, physical 
and mathematical, not to be confused. 

[ I6 ] Note that all possible EFs form an infinitely-dimensional space, and 
there are many radically non-equivalent ways to define convergence 
(i.e. topology) in such spaces (see any textbook on functional analy- 
sis). To appreciate the subtlety of the problem recall the large-scale 
study of G. C. Fox and S. Wolfram, Nucl. Phys. B149, 413 (1979), 
who adopted an incorrect idealization of EFs as functions on the unit 
sphere with L 2 topology (familiar from quantum mechanics) and the 
research was lead astray towards studying spherical harmonics etc. 
Our C-convergence is the well-known weak convergence of linear 
functionals. It cannot be described in terms of a single-valued distance 
function or norm (cf. Ref. 14). This may be psychologically uncom- 
fortable but such is the nature of calorimetric measurements. 

[ 17 ] Cf. the cumbersome constructions of Fox and Wolfram, in Ref. 16. 

[ 18 ] G. Sterman, Phys. Rev. D19, 3135 (1979). 

["] Cf. F. R. Ore and G. Sterman, Nucl. Phys. B165, 93 (1980). 

[ 20 ] Note that the energy correlations of C. C. Basham et al., Phys. Rev. 
Lett. 41, 1585 (1978) are a special case of F m — but with a discon- 
tinuous f . Since hadronization is not inverted here, the lack of C- 
continuity is numerically less important than with recombination algo- 
rithms. 

[ 21 ] F. V. Tkachov, in "Joint International Workshop on High Energy 
Physics" (September 1993, Zvenigorod, Russia), ed. 
B. B. Levtchenko (INP MSU: Moscow, 1994), p. 80. 

[ 22 ] For e + e~ this is the total 4-momentum. For ep one may wish to 
choose a different frame — as described e.g. by B. Webber, J. Phys. 
G19, 1567 (1993). 

[ 23 ] I thank B. B. Levtchenko for numerical checks of this. 

[ 24 ] Such a jet counting procedure was called Moscow sieve in 

F.V. Tkachov, technical report, INR-F5T/93-01 (unpublished). 

[ 25 ] The figure suggests that the importance of Sudakov effects in the con- 
ventional recombination algorithms is an artefact due to the instability 
of the latter. 

[ 26 ] S. Catani et al., Nucl. Phys. B406, 187 (1993) discuss relationship 
between jet clustering algorithms and event shape measures, including 
the monotonicity. 

[ 27 ] whose importance is discussed by B. W. Webber, in Ref. 7. 

[ 28 ] M. A. Shifman et al, Nucl. Phys. B147, 448 (1979). 

[ 29 ] F. V. Tkachov, Phys. Lett. B125, 85 (1983). 

[ 30 ] The required mathematics is reviewed in Ref. 6. 



